Genetic object recognition using combinations of views

نویسندگان

  • George Bebis
  • Evangelos A. Yfantis
  • Sushil J. Louis
  • Yaakov L. Varol
چکیده

We investigate the application of genetic algorithms (GAs) for recognizing real two-dimensional (2-D) or three-dimensional (3-D) objects from 2-D intensity images, assuming that the viewpoint is arbitrary. Our approach is model-based (i.e., we assume a predefined set of models), while our recognition strategy lies on the recently proposed theory of algebraic functions of views. According to this theory, the variety of 2-D views depicting an object can be expressed as a combination of a small number of 2-D views of the object. This implies a simple and powerful strategy for object recognition: novel 2-D views of an object (2-D or 3-D) can be recognized by simply matching them to combinations of known 2-D views of the object. In other words, objects in a scene are recognized by “predicting” their appearance through the combination of known views of the objects. This is an important idea, which is also supported by psychophysical findings indicating that the human visual system works in a similar way. The main difficulty in implementing this idea is determining the parameters of the combination of views. This problem can be solved either in the space of feature matches among the views (“image space”) or the space of parameters (“transformation space”). In general, both of these spaces are very large, making the search very time consuming. In this paper, we propose using GAs to search these spaces efficiently. To improve the efficiency of genetic search in the transformation space, we use singular value decomposition and interval arithmetic to restrict genetic search in the most feasible regions of the transformation space. The effectiveness of the GA approaches is shown on a set of increasingly complex real scenes where exact and near-exact matches are found reliably and quickly.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Genetic Algorithms for 3D Object Recognition

We investigate the application of genetic algorithms for recognizing 3D objects from two-dimensional intensity images, assuming orthographic projection. The recognition strategy is based on the recently proposed theory of algebraic functions of views. According to this theory, the variety of 2D views depicting a 3D object under the case of orthographic projection can be expressed as a linear co...

متن کامل

Three-dimensional object recognition based on the combination of views.

Visual object recognition is complicated by the fact that the same 3D object can give rise to a large variety of projected images that depend on the viewing conditions, such as viewing direction, distance, and illumination. This paper describes a computational approach that uses combinations of a small number of object views to deal with the effects of viewing direction. The first part of the p...

متن کامل

Interpolation of Novel Object Views from Sample Views

In this article we address the problem of threedimensional object recognition from two-dimensional views. We use a viewer-centered model of object representation and interpolate novel views from stored sample views. The sample views are represented by graphs which are labeled with Gabor wavelet responses as local descriptors of object points. The positions of the object points in a novel view a...

متن کامل

Handbook of Pattern Recognition and Computer Vision, Pp. 863{882 Viewer-centered Representations in Object Recognition: a Computational Approach

Visual object recognition is a process in which representations of objects are used to identify those objects in images. Recent psychophysical and physiological studies indicate that the visual system uses viewer-centered representations. In this chapter a recognition scheme that uses viewer-centered representations is presented. The scheme requires storing only a small number of views to repre...

متن کامل

Recognition by Linear Combinations of Models

Visual object recognition requires the matching of an image with a set of models stored in memory. In this paper, we propose an approach to recognition in which a 3-D object is represented by the linear combination of 2-D images of the object. IfJLk{M1,.” .Mk} is the set of pictures representing a given object and P is the 2-D image of an object to be recognized, then P is considered to be an i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Trans. Evolutionary Computation

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2002